Fault-tolerant linear optics quantum computation by error-detecting quantum state 

transfer 



■ 

o ■ 
o : 

> ■ 

O: 
t> : 

m ■ 
> : 

m . 
r- ■ 
o ■ 
(N : 

\o ■ 

o : 

^ ■ 

a,: 

^— > ■ 
c ■ 

=3 : 
cr 



X 



Jaeyoon Cho 

Division of Advanced Technology, Korea Research Institute of Standards and Science, Daejeon 305-340, Korea 

(Dated: February 1, 2008) 

A scheme for linear optical implementation of fault-tolerant quantum computation is proposed, 
which is based on an error-detecting code. Each computational step is mediated by transfer of 
quantum information into an ancilla system embedding error-detection capability. Photons are 
assumed to be subjected to both photon loss and depolarization, and the threshold region of their 
strengths for scalable quantum computation is obtained, together with the amount of physical 
resources consumed. Compared to currently known results, the present scheme reduces the resource 
requirement, while yielding a comparable threshold region. 



I. INTRODUCTION 

One of the main obstacles to implementing a quantum 
computer using single-photon qubits is the lack of high 
nonlinearity between individual photons. Thanks to the 
development of the linear optics quantum computation 
(LOQC) scheme Q, we now believe that such a problem 
would be solved in terms of measurement-induced non- 
linearity. Recently the demanding requirements of the 
original LOQC scheme have been significantly reduced 
by importing the idea of one-way quantum computation 
(US El- In this context, much effort has been devoted to 
improving the key experimental techniques involved in it, 
such as generation of single-photon cluster states [f| @, 0] 
and storing single photons coherently in fiber loops for a 
period of time needed for feed- forward @, Q . 

Although the LOQC approach seems to be quite 
promising, we are remained with another essential re- 
quirement for the practical realization: an ability to 
cope with inevitable physical noise originated from de- 
coherence or imperfect operations. Fortunately, we are 
equipped with the ingenious th eory of fault-tolerant 
quantum computation [10|, [QL E Elf Q Gil- The main 
result of it is the threshold theorem which states that a 
scalable quantum computation can be performed with an 
arbitrary precision provided the noise strength is below 
a certain threshold. 

Applying the threshold theorem to the model of one- 
way quantum computation is not a trivial task. The 
main reason is that the threshold theorem was originally 
devised for the quantum circuit model. Recently, there 
have been reports on the existence of the fault-tolerant 
threshold for one-way quantum computation (l6j and also 
on the estimation of its value (l7l lla ] . 

Concerning LOQC, however, it is more difficult to ob- 
tain the threshold result, since additional effects due 
both to the non-deterministic nature of optical two-qubit 
gates and to photon loss have to be taken into account. 
In Ref. this issue was addressed and the fault- 

tolerant threshold was calculated by introducing an error- 
correction scheme tailored to LOQC. The basic idea was 
putting a layer bridging the physical level, wherein non- 
deterministic two-qubit gates are used, and the higher 



levels, wherein fault-tolerant quantum circuit computa- 
tion is performed as usual. In the bridging layer, such 
non-determinism of two-qubit operation is overcome by 
off-line preparation of ancilla cluster states, exploiting a 
massive amount of parallel operations and filtering pro- 
cesses. Although it is a remarkable result, the problem 
is that its resource requirement is extremely demanding, 
which leaves much room for improvement. 



In fact, most of the resources in the above scheme are 
consumed in the bridging layer, wherein two-qubit gates 
with a relatively low success probability have to be used 
so as to embed the error correction into the ancilla state 
and thus a massive amount of photons are filtered out. 
Therefore, if this layer is simplified by using a smaller 
code, the resource consumption might be significantly re- 
duced. This reasoning is the starting point of the present 
work. In this paper, a fault-tolerant LOQC scheme using 
an error-detecting code is proposed. The use of an error- 
detecting code and a consequent error-detecting scheme, 
which are much simpler than the error-correcting coun- 
terparts, results in the reduction of the resource con- 
sumption in many orders of magnitude. Remarkably, this 
is done without the expense of decreasing the tolerable 
noise level. 



The present work is on the same footing as recent inves- 
tigations of LOQC based on polarization-entangled single 
photon qubits, fusion gates, and one-way quantum com- 
putation 0, Ho] • The details of the physical model are 
described in Sec. |TTJ In Sec. QTI] a fault-tolerant quan- 
tum computation scheme using an error-detecting code 
is introduced. This scheme is similar in many respects to 
that of Ref. [l5| albeit a different error-detecing code is 
used and modifications are made to tailor it to LOQC. 
Sec. IIVI explains how this fault-tolerant quantum com- 
putation scheme is implemented with the resources de- 
scribed in Sec. [TTJ This LOQC scheme is simulated using 
the numerical method outlined in Sec. fVl with the noises 
described in Sec. [IT] being taken into account, and the 
numerical results of the error statistics are surveyed in 
Sec. EU Finally, the conclusion is given in Sec. IVII1 
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II. PHYSICAL MODEL 

A. Physical resources and elementary operations 

We are supposed to be given a sufficient amount of (a) 
two-photon polarization-entangled Bell states, (b) linear 
optical elements such as beam splitters and waveplatcs, 
(c) photodetectors resolving photon numbers, and (d) 
quantum memory gates that are used to store and re- 
trieve single photons without affecting the polarization 
states. We denote the computational basis states of a 
single photon qubit, or the eigenstates of Pauli operator 
Z, by |0) and 

The underlying idea is using these resources to gener- 
ate cluster states of single photon qubits and measuring 
individual photons of them to simulate quantum logic op- 
erations. In spite of the intrinsically probabilistic nature, 
a linear optical two-qubit gate can be used to build up a 
large-sized cluster state efficiently provided it works in a 
conclusive manner, i.e., it succeeds only with a probabil- 
ity less than one but whether the operation has succeeded 
or not can be detected. To build up a large-sized cluster 
state, one starts with combing resources (a), which are 
in fact two-qubit cluster states, into larger cluster states 
using two-qubit gates, and proceeds with combining suc- 
cessfully combined cluster states into more larger ones. 
This process thus requires a large amount of parallel op- 
erations and classical feed-forwards. 

The (type-I) fusion gate is well suited for this purpose 
It is implemented by mixing two photons on a po- 
larizing beam splitter and measuring one of the output 
ports in the basis of |±) = ^(|0) ± |1}), the eigenstates 

of Pauli operator X. In case two photons included in 
cluster states are input to a fusion gate, one and only 
one photon is detected with a probability 1/2, in which 
case, two-photon input states 1 00) and |11) are projected 
respectively into single-photon output states [0} and |1), 
resulting in combining the cluster states into a larger one. 
In other cases, wherein no or two photons are detected, 
both input photons are, in effect, measured in the Z ba- 
sis, resulting in simply removing the two photons from 
the cluster states. 



B. Noise model and assumptions 

As in the conventional theory of fault-tolerant quan- 
tum computation, the whole computational process is 
assumed to be split into unit time steps. In our case, 
one time step corresponds to the time taken by one mea- 
surement operation (including the ensuing classical feed- 
forward), which is the dominant time scale involved in the 
real experiment Q. Each memory gate, fusion gate, and 
measurement thus takes one time step, and Bell states 
are prepared before any desired time steps. At each time 
step, each photon undergoes two types of noise indepen- 
dently: photon loss and depolarization, whose respective 
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FIG. 1: Quantum circuit for error-detecting quantum state 
transfer. The inset shows a circuit for the generation of \+) L , 
the +1 eigenstate of the encoded Pauli operator Xl- 



strengths are parameterized by 7 and e. Each photon 
is lost independently with probability 7 after Bell state 
preparation, before a memory gate, before a fusion gate, 
and before measurement. Depolarization is simulated by 
applying randomly chosen Pauli operators (A, Z, and 
Y = XZ) at relevant locations: after Bell state prepa- 
ration and before a fusion gate, one of the 15 possible 
non-identity Pauli products is applied to the two pho- 
tons each with probability e/15; before a memory gate 
and before measurement, one of the 3 Pauli operators is 
applied to the photon each with probability e/3. We also 
make the conventional assumptions: any pair of photons 
can be input to a fusion gate at any time without ad- 
ditional overhead; any number of elementary operations 
can be performed in parallel; processing of classical infor- 
mation takes a sufficiently small time and is error-free. 



III. FAULT-TOLERANT QUANTUM 
COMPUTATION BY ERROR-DETECTING 
QUANTUM STATE TRANSFER 

A. Pauli frame 

Instead of implementing Pauli operators physically, we 
just keep track of a product of Pauli operators which 
should have been applied to the state. This Pauli prod- 
uct is called the Pauli frame [l5|, [2(J [HJ . The Pauli frame 
is updated after each operation according to the quantum 
circuit identities. This is possible since we use the Clif- 
ford gates that transform Pauli operators into Pauli op- 
erators under conjugation. In the same fashion, it is also 
possible to interpret measurement results with respect to 
the Pauli frame. In our case, all measurements involved 
in the error detection are performed in the X basis, thus 
the measurement value is flipped if the corresponding 
Pauli frame is Z or Y. The use of the Pauli frame and 
the appropriate measurement basis greatly reduces the 
amount of feed-forwards imposed in the original idea of 
one-way quantum computation. It is important because 
feed-forward is one of the main bottlenecks in LOQC. 



3 



B. Error-detecting code and encoded operations 

We use a 4-qubit stabilizer code, which is stabilized 
by three operators: Si — X1X2X3X4,, §2 — Z1Z2I3I4, 
and £3 = I1I2Z3Z4, where Xi and Zi are the Pauli 
operators and Ii is the identity operator acting on the 
ith qubit. Since the distance of this code is two, it 
detects one error. The encoded Pauli operators are 
chosen to be Z L = Z1I2Z3I4 and X L — XiX 2 l3l4- 
The corresponding encoded eigenstates are then given by 
|0) L = ^(10000) + 11111)), |1) L = ^(|0011) + 11100)), 

and \±) L = |(|00) ± |11))(|00) ± |11)). This choice sim- 
plifies the generation of an encoded state \+) L since it 
decomposes into two Bell states. Each Bell state is gen- 
erated by measuring the middle qubit of a linear three- 
qubit cluster state in the X basis, as shown in the in- 
set of Fig. [1] This is a transversal operation thanks 
to the symmetry of Bell states. Note that this process 
can be interpreted as generating a two-qubit cluster state 
-^j (| 0) |+) + |1) |— )) and performing an Hadamard oper- 
ation to the second qubit, where a |+) state, a CPHASE 
gate, and an A-basis measurement are regarded as con- 
stituting an Hadamard operation. At the level of encoded 
states, this is actually the simplest way of performing a 
transversal Hadamard operation. The encoded CPHASE 
operation is performed by four disjoint CPHASE oper- 
ations as J7(l,5)C/(2,7)J7(3,6)C/(4,8), where U(i,j) de- 
notes CPHASE between qubits i and j, and two encoded 
qubits are represented, respectively, by qubits 1 to 4 and 
5 to 8. The measurement of the encoded operator Xl or 
Zl is performed by measuring each of the four qubits in 
the same basis of X or Z. Note that these measurements 
give redundant information. For instance, the value of 
the -basis measurement is given either by the mea- 
surement of X1X2 or X3X4. This redundancy plays an 
important role in the code concatenation described later. 



C. Error-detecting quantum state transfer 

Fig. [T] depicts a quantum circuit for an encoded mem- 
ory gate. Two encoded ancilla qubits are first prepared 
in an encoded cluster state and then connected to an en- 
coded input qubit through an encoded CPHASE gate. 
We then measure both the input qubit and the next an- 
cilla qubit in the Xl basis. It is easily seen that this 
process simply transfers the encoded input state to the 
last encoded ancilla qubit up to the Pauli frame correc- 
tion [2J. An important insight into this process, which 
we will call the error-detecting quantum state transfer, 
is that it also embeds the syndrome information into the 
measurement results. Suppose first that the quantum 
circuit in Fig. [1] is error-free. If one of the input qubits 
has an X error, the corresponding value among those of 
measurements 5 to 8 is flipped, which leads to the par- 
ity of the four measurement values being odd. In the 
same manner, one Z error in the input leads to the par- 



ity for the measurement of X1X2X3X4 being odd. One 
can easily check that a Pauli error at any one location in 
the whole circuit is also indicated by the parity checks, 
or possibly by the next round of error-detecting quantum 
state transfer. The idea outlined here can be easily ex- 
tended in such a way that an encoded operation is also 
embedded into the circuit by preparing a different ancilla 
state. The details will be described later. 



D. Code concatenation 

We concatenate the encoding described above hierar- 
chically in such a way that four qubits at each level con- 
stitute one qubit at the next higher level. This hierarchy 
of concatenation ranges from level 0, at which a qubit 
is encoded in one photon, to level Id, at which a qubit 
is encoded in A ld photons. One objective of the present 
work is to obtain the statistics of errors occurring in each 
operation at each level. In the conventional theory using 
an error-correcting code, they are described in terms of 
the error rate, the probability that the operation yields 
a logically incorrect output, i.e., an output with errors 
that can not be corrected even by noiseless error correc- 
tion (in case the output has correctable errors, they are 
counted in obtaining the error rate of the next opera- 
tion). Error rates obtained in this way for every individ- 
ual operations at a certain level are used as parameters 
for obtaining error rates at the next higher level. For in- 
stance, let us consider a case of one-error-correcting code 
and denote the error rate at level by p. At level 1, 
the error rates behave as p 2 since every events occurring 
with probability of order p are those having an error at 
a single location, which can be corrected. In the same 
fashion, the error rates at level / behave as p 2 . They 
should decrease rapidly as long as the initial error rate 
p is sufficiently small; more precisely, as long as p is less 
than the fault-tolerant threshold. 

In our case, it is necessary to distinguish between two 
types of errors, namely, located and unlocated errors. If 
a set of Pauli errors and the locations at which they oc- 
curred is such that the embedded error-detecting process 
detects it, it is counted as a located error. Accordingly, 
the located error rate is defined as the probability of oc- 
currence of such errors. On the other hand, if the set con- 
tains Pauli errors and locations which are not detected by 
the embedded process while causing a logically incorrect 
output, it is counted as an unlocated error (detectable 
errors remaining in the output are counted in obtaining 
the error rates of the next operation). The unlocated 
error rate is defined rather differently as the probability 
of occurrence of an unlocated error given no located er- 
ror occurs. The reason it is defined as such a conditional 
probability is that ancilla states having located errors are 
all discarded from the computation. Suppose that the 
located and unlocated error rates at a certain level are 
respectively of order q and p. Those at the next higher 
level then behave respectively as 0(q) + 0(p) and 0(p 2 ). 
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The problem is that, as this estimation indicates, error 
detection does not decrease located error rates. Conse- 
quently, the previous scenario in which error correction 
is used does not work in our case. We overcome this 
difficulty by exploiting the redundancy of encoded Pauli 
measurements, explained in Sec. IIII Bl For example, if 
a locate error occurs during the CPHASE operation cor- 
responding to measurements 2 and 7 in Fig. [U instead 
of discarding the operation, we use the results of mea- 
surements 3, 4 and 5, 6 to obtain the values of the two 
X^-basis measurements (of course, in the ancilla prepa- 
ration step, no located error is allowed). By using this 
method, the located and unlocated error rates of the en- 
coded operation behave respectively as 0(p) + 0(q 2 ) and 
0(qp) + 0(p 2 ). For sufficiently small q and p, these error 
rates decay exponentially as the level of encoding gets 
higher, as in case of error correction. We will use both 
of the above decoding methods, and call the former the 
strong- detection mode and the latter the weak- detection 
mode. For brevity, we will also use the terms strongly- 
detected and weakly- detected operations. Note that al- 
though we classify each operation further into two types 
according to how the syndrome information is decoded, 
there is no difference in their physical implementation. 
The basic strategy is as follows: In the ancilla prepara- 
tion stage, we mainly use strongly-detected operations 
to reduce error rates as much as possible. If any located 
error occurs, the ancilla state is discarded and the prepa- 
ration is restarted. Once an ancilla state is successfully 
generated, we use weakly-detected CPHASE operations 
to connect it to an input state, in which way we reduce 
the chance of destroying the main computation. 



E. Universal set of quantum gates 

The universal quantum computation is guaranteed by 
bringing in the preparation of state \ir/8) — cos(7r/8) |0) + 
sin(7r/8) |1) which allows the implementation of the tt/8 
gate T = exp(— i^Z). In order to prepare the \tt/8) 
state, we first prepare a Bell state at the topmost level. 
We then measure one of the two qubits in the basis of 
{|tt/8) , |5tt/8}}, where |5tt/8) = ZX |vr/8) is the state 
orthogonal to the |7r/S) state. This measurement is done 
by measuring the corresponding lower-level qubits, re- 
spectively, in the bases of Z, Z, X, and { |7r/8) , |57r/8}}, 
the last of which is, in turn, done by measuring the lower- 
level qubits in the same way. As a result, the other qubit 
at the topmost level is remained in the |7r/8) state up 
to the Pauli frame correction. Note that errors occur- 
ring during the measurement could introduce an error at 
the topmost level, but they do not destroy the encoding 
structure. We can thus purify multi ple copies of noisy 
7r/8) states at the topmost level [23j. The error 

rates of this preparation is expected to be determined 
dominantly by those of measurements at low levels, which 
is confirmed by numerical simulation. The unlocated er- 
ror rate of the preparation of a \n/8) state is found to be 



well within the bounds for successful purification. 

IV. LINEAR OPTICAL IMPLEMENTATION 

A. Level-0 encoding 

Quantum state transfer at the bottom level relies on 
microclusters and parallel fusion as outlined in Ref. [20| . 
Quantum information is stored in the center node of a 
star-shaped cluster state, called a microcluster. In or- 
der to transfer the state of one center node, say qubit 
1, to another center node, say qubit 2, we apply fusion 
gates pairwise between the dangling nodes in parallel, 
and measure qubit 1 and all successfully fused nodes in 
the X basis. In case one or more fusion gates succeed, the 
state of qubit 1 is transferred to qubit 2 up to the Pauli 
frame correction. If the number of fused nodes is two or 
more, the ensuing measurements of them should give the 
same value, if not affected by noises. If it is not the case, 
a majority vote is taken over the values, and if it is not 
allowed, i.e., half of them are +1 and the rest are —1, we 
conclude qubit 2 has a located error. We also come to 
the same conclusion if any of the involved operations indi- 
cates a photon loss. If we ignore noise, the success prob- 
ability of the parallel fusion increases exponentially as 
the number of dangling nodes increases. More dangling 
nodes, however, introduce more noise and thus decrease 
the success probability asymptotically. In this paper, the 
number of dangling nodes of each microcluster is fixed as 
4. Note that, throughout this paper, we will always re- 
gard a fusion gate and the ensuing A-basis measurement 
as being performed in one time step, since it can be done 
by measuring both the output photons of the polarizing 
beam splitter in the X basis. The noise model for this 
combined operation will just follow the rules described in 
Sec. Ill Bi as if it is performed in two time steps. 

B. Level-1 encoding 

Fig. [H[a) depicts the cluster state to simulate a level- 
1 memory gate. Each circle represents a single-photon 
qubit. Fusion gates are applied in the order indicated 
by the numbers. The resulting cluster state is then com- 
posed of three columns, each representing a level-1 qubit, 
and the dangling nodes at both sides (leftmost nodes for 
input and rightmost nodes for output). It is easily seen 
that the quantum circuit simulated by this cluster state 
is equivalent to that of error-detecting quantum state 
transfer shown in Fig. [TJ All qubits in the three columns 
are measured in the X basis. Note that these measure- 
ments can be done even before an input state is fused (an 
analogous idea is used for the telecorrector introduced 
in Ref. Actually, each measurement is performed 

together with a preceding fusion gate in one time step 
as explained earlier. If any of the fusion gates or the 
measurements indicates a located error, this preparation 
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FIG. 2: Cluster states for the level-1 encoding: (a) a memory 
gate, (b) a Bell-state, and (c) a CPHASE gate. Fusion gates 
are applied in the order indicated by the numbers. 



stage is restarted. Otherwise, the resulting state is ac- 
cepted and finally fused with a level-1 input qubit on the 
left side. Note that the measurements in the rightmost 
column do not give information about the encoded state, 
since it is transferred to the rightmost dangling nodes. 

This idea can be easily extended to other operations. 
For an X-basis measurement, we construct the same clus- 
ter state without the rightmost dangling nodes. Note 
that in this case the measurement parity of the right- 
most column is not altered by the Pauli frame correction 
performed after fusing an input qubit. If this measure- 
ment parity is found to be odd, it means the ancilla state 
has an error and we thus restart the preparation stage. 
A CPHASE operation is performed in a similar way by 
constructing a cluster state composed of six columns of 
qubits, say columns 1 to 6, as shown in Fig. [2jc). In 
this case, columns 1 and 6 have dangling nodes for the 
input, while columns 3 and 4 have dangling nodes for 
the output. In case a CPHASE gate is followed by mea- 
surements, they are also embedded into the preparation 
stage. For example, when the first output qubit of a 
CPHASE gate should be measured, we construct a clus- 
ter state composed of six columns with dangling nodes 
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FIG. 3: Quantum circuit for a level-? memory gate (a) for 
2 < I < l c and (b) for I > l c . Each column represents one 
time step. 
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FIG. 4: Quantum circuit for a level-? CPHASE gate (a) for 
2 < I < l c and (b) for I > l c . Each gate depicted with thick 
lines is an abbreviation of the corresponding four level-(7 — 1) 
gates. Each column represents one time step. 



attached to columns 1, 4, and 6, and the measurement 
parity of column 3 is used to filter out noisy ancilla states. 
Another useful operation is the Bell-state preparation 
shown in Fig. Hlb), which are used for the preparation 
of a level- 2 |+) state. The resulting state is accepted 
unless any photon loss is detected or the measurement 
parity of the middle column is found to be odd. 

Remarkably, once appropriate cluster states are gener- 
ated (with all qubits being measured except at the dan- 
gling nodes), every level-1 gate operations are performed 
in one time step by fusing and measuring dangling nodes 
in parallel. Level-1 gates can be thus treated as if they 
are unit gates occupying one time step in a level-2 quan- 
tum circuit with a particular located and unlocated error 
rate. Therefore, once the error rates of individual level-1 
gates are calculated for given level-0 noise rates 7 and 
e, the problem reduces to that of determining whether 
the set of level-1 error rates is within the fault-tolerant 
threshold of the conventional quantum circuit model. 



C. Second and higher levels of encoding 

From the second level of encoding, each gate opera- 
tion is performed by directly implementing the quantum 
circuit shown in Fig. [1] or its variant. A normal way 
of doing this would be to arrange involved lower-level 
gates consecutively along the time axis in a way as de- 
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picted in Fig. EJb) for a memory gate or Fig. HJb) for 
a CPHASE gate. This method, however, requires error 
rates lower than can be achieved by our level-1 gates. 
This is the case even if the physical noises are neglected 
(7 = e = 0), owing to the probabilistic nature of the fu- 
sion gate. In such a case, the level-0 state transfer fails 
with probability 0.5 4 = 0.0625, as we use four dangling 
nodes per qubit, and this gives the level-1 located error 
rate of about 0.0215. This value is higher than the lo- 
cated error rate required for the memory gate shown in 
Fig. [IJb) to work correctly, which is about 0.0210 even if 
the unlocated errors are all neglected. 

We overcome this difficulty by introducing delayed fu- 
sion, which exploits the fact that two qubits can be in- 
put to a CPHASE gate at different time steps. For in- 
stance, we modify the quantum circuit for a memory 
gate as shown in Fig. [3]Ja), and in the same fashion a 
CPHASE gate as shown in Fig. 01a). For a memory 
gate, we first implement the ancilla part up to the half 
of the final gates (CPHASE plus measurement), corre- 
sponding to measurements 5 to 8, while the input dan- 
gling nodes at the other half are left untouched. Only 
when this preparation stage is successful, an input qubit 
is finally fused, during which the output dangling nodes 
are left untouched. Compared to the circuit in Fig.^b), 
this method puts more larger parts into the preparation 
stage so that located errors on them can be also filtered 
out. As a result, the located error rate of the encoded 
gate is much reduced. However, this also introduces idle 
time steps of dangling nodes causing additional level-0 
memory noises. Moreover, these noises accumulate as 
the level of encoding gets higher. That is, a level-2 gate 
uses level-1 gates delayed one time step, but a level-3 gate 
uses level-2 gates delayed one time step which in turn use 
level-1 gates delayed two time steps. After the accumu- 
lated noise exceeds a certain bound, additional levels of 
concatenation will make things worse rather than better. 
Therefore, we use this method only until we reach the 
l c th level of encoding in which we get sufficiently low er- 
ror rates, and from the (l c + l)th level, we proceed with 
the implementation normally without the delayed fusion 
as in Fig. |3fb) and Fig. Hps). 



V. NUMERICAL SIMULATION 

Error rates are calculated numerically using the Monte 
Carlo simulation [2(J. For each iteration of the simula- 
tion, randomly chosen errors are assigned to every in- 
volved gates according to the error rates, and the result- 
ing error in the encoded output qubit is identified. In so 
doing, we do not have to care about the full state of the 
involved qubits, since our concern is the error statistics 
that can be completely predicted just by recording the er- 
rors together with the locations at which they occurred. 
The details can be best illustrated by considering several 
examples. 

Let us first consider the strongly-detected level-1 mem- 
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FIG. 5: Cluster state equivalent to that generated during the 
level-1 memory operation. Every qubits represented by filled 
circles are measured in the X basis. 

ory gate. It is implemented by preparing an ancilla state 
as shown in Fig.[2](a), which involves sixteen star-shaped 
five-qubit cluster states. Each of them is generated by 
fusing middle qubits of three-qubit linear cluster states, 
each of which is in turn generated by fusing two-qubit 
cluster states. Note that in doing this bottom-up con- 
struction, all involved states are cluster states (with cor- 
responding Pauli frames). Any cluster state with a Pauli 
error (i.e., an error given by a product of Pauli operators) 
can be transformed to a noiseless state with an incorrect 
Pauli frame having an error composed of only Z oper- 
ators. It is easily seen by observing a quantum circuit 
representing such a noisy cluster state composed of |+) 
states, CPHASE gates, and Pauli gates representing the 
Pauli frame and the Pauli errors. From the quantum cir- 
cuit identities, the order of a CPHASE and an X gate 
can be exchanged by adding a Z gate, whereas the order 
of a CHASE and a Z gate can be freely exchanged. By 
using these properties, every X errors can be put next to 
the |+) states, and using X |+) = |+), we are remained 
with an error composed of only Z operators. Note that a 
phase factor — 1 arising in exchanging the order of X and 
Z is a global phase which has no practical meaning. To 
simulate a noisy fusion gate, we first apply Pauli errors 
randomly according to the rule in Sec. Ill Bl transform all 
errors to Z errors in Pauli frames, and apply a noiseless 
fusion gate. Considering only Z errors in Pauli frames, a 
noiseless fusion gate works in such a way that the output 
qubit has a Z error only in case either of input qubits has 
a Z error while the other has no error. Noisy measure- 
ments are simulated in a similar way: since all qubits 
are measured in the X basis, a measurement yields an 
incorrect result only in case the qubit has a Z error in 
the Pauli frame. Once a noisy cluster state for a level-1 
memory is generated in this way and if no located error 
is detected in it, it is fused with an input state having 4 
qubits and 16 dangling nodes attached to them, and we 
proceed as follows. 

(1) If any of the four parallel fusions between the input 
and the ancilla indicates a located error, this memory 
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gate is counted as having a located error. Otherwise, we 
proceed to the next step. 

(2) The state is now equivalent to the cluster state 
shown in Fig. [5] with every qubits represented by filled 
circles being measured in the X basis. The next step is 
to simulate the propagation of errors from the input to 
the ancilla state. For example, if qubit 1 in Fig. [5] has a 
Z error, it is propagated to qubit 5. Although such an 
error also induce an X error at the fused node between 
qubits 1 and 5, it has no effect as the node is measured 
in the X basis. In the same manner, if the fused node 
between qubits 2 and 6 has a Z error, it is propagated to 
qubit 11. 

(3) The syndrome measurement for qubits 5 to 8 is 
examined. If the number of Z errors among them is one 
or three, the error is detected, thus this memory gate is 
counted as having a located error. On the other hand, if 
the number of Z errors is two and if one of them exists 
at either qubit 5 or 6 while the other at either qubit 7 or 
8, the error flips the value of this X^-basis measurement, 
resulting in incorrect update of the Pauli frame. This is 
reflected by applying Z errors to qubits 13 and 15. 

(4) The syndrome measurement for qubits 9 to 12 is 
examined in the same way as in step (3). If the error is 
detected, this memory gate is counted as having a located 
error. On the other hand, if the error flips the value 
of this Xf,-basis measurement, instead of pretending the 
incorrect Pauli update as in step (3), we just conclude 
this memory gate has an unlocated X error (note that 
we have performed a transformation so that there are no 
more X errors in the ancilla state). We do not pretend 
the incorrect Pauli frame update, since it is caused by 
the error that has been already counted. 

(5) The Z errors at qubits 13 to 16 are examined. If the 
number of Z errors is two and if they exist respectively at 
qubits 13, 14 and qubits 15, 16, we conclude this memory 
gate has an unlocated Z error, and this unlocated error 
is removed from the state, since it has been counted. 

(6) If this memory gate has both unlocated X and 
Z errors, the error is counted as an unlocated Y error 
instead. The remaining level-1 output qubit, i.e., qubits 
13 to 16 and the attached dangling nodes, is used as an 
input state for the next round of the simulation. 

By following these steps, one sample of the Monte 
Carlo simulation is yielded. A remaining problem is, 
however, that the error rates depend on the errors in 
the input state as well. A standard treatment is apply- 
ing memory gates successively to a noiseless initial state 
until the transient effect washes out, and taking the next 
memory gate to quantify the error statistics [3, [2(| ■ Ac- 
tually, such a transient effect rapidly washes out in our 
case as well. It can be shown numerically that after one 
memory operation, each successive memory gate exhibits 
the same error rates. We thus repeat applying a level- 
1 memory gate to a noiseless input state until the gate 
yields no located error and take the output of it as an in- 
put for another level-1 memory gate to obtain one sample 
of the simulation. In fact, at the second level of encoding, 



level-1 memory gates are used not after memory gates but 
after CPHASE gates as shown in Fig. [3J but that does 
not make substantial changes in the error rates. 

The error rates of the level-1 Bell-state preparation 
shown in Fig. [2jb) are also obtained in the same man- 
ner. In this case, we use the fact that a Bell state is an 
eigenstate of X1X2 and Z1Z2 to transform all unlocated 
errors into single-qubit errors. For example, X1Z2, Y1Z2, 
and Z\ errors are respectively equivalent to Y2, X2, and 
Z2 errors, and so forth. Concerning the level-1 CPHASE 
gate shown in Fig. HJc) , we simplify the simulation by us- 
ing a numerically confirmed fact that the errors occur as 
if level-1 memory errors independently affect two input 
qubits and an error- free level-1 CPHASE gate follows, 
making the errors propagate accordingly in such a way 
that an X error in one qubit induces a Z error in the 
other qubit. It is also confirmed that the same property 
holds for all other level-1 gates embedding a CPHASE 
gate. For example, to simulated a level-1 CPHASE-plus- 
measurement gate included in the circuit of Fig. [3Ja), we 
apply the errors of the level-1 measurement and the one- 
step-delayed level-1 measurement independently to each 
qubit according to the corresponding error rates, and 
apply an error-free CPHASE-plus-measurement gate af- 
terward, making the errors propagate accordingly. Note 
that an n-step-delayed measuremet is always performed 
to an n-step-delayed qubit for any n, which has to be 
correctly reflected in the simulation by applying addi- 
tional level-0 noises to the dangling nodes. To sum up, 
we just obtain the single-qubit error rates for the Bell- 
state preparation, memory, measurement, and each n- 
step-delayed measurement, and in case a CPHASE oper- 
ation is involved, the error rates are obtained by using the 
single-qubit error rates and the error propagation rule. 
Note also that, in Fig. [3] or Fig. H the CPHASE gates 
next to the Bell-state preparation exhibit different error 
rates since the inputs are not the outputs of memory 
gates. This does not arise as a problem since the error 
rates of them are in fact smaller than those we use, thus 
we will not underestimate the error rates. 

From the second level of encoding, obtaining the error 
rates is simpler and consumes less time because we now 
simulate the errors in the conventional quantum circuit 
model having a much smaller number of error locations. 
The underlying method is the same as that used in the 
level-1 simluations: we obtain the single-qubit error rates 
and use the error propagation rule to obtain the error 
rates of two-qubit gates. 



VI. FAULT-TOLERANT THRESHOLD AND 
RESOURCE CONSUMPTION 

The located and unlocated error rates of individual 
operations are calculated starting from a given set of pa- 
rameters {7, e}, and when the error rates asymptotically 
go to zero as the level of encoding gets higher, the given 
set is said to be within the threshold region. In our case, 
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finding out such an asymptotic behavior involves exhaus- 
tive calculations since we have to obtain the error rates 
of every n-step-delayed measurements and moreover the 
asymptotic behavior depends on which level is chosen as 
l c [see Sec. IIV C| . For simplicity, we just fix l c = 5 and 
define one particular sufficient condition as our fault- 
tolerance criterion: we say the set {7, e} is within the 
threshold region if the error rates of the level-5 opera- 
tions satisfy the following conditions: 

max{Q M ,Q s } < 1(T 2 , (1) 

max{Pf ,P#, 100P#} < 1(T 6 , (2) 

max{P| f , Pf , 100P^} < 10~ 4 , (3) 

max{P|, Pf, 100P#} < 10" 4 , (4) 

where Q and P denote, respectively, located and unlo- 
cated error rates, subscripts X, Y, and Z denote the 
types of the unlocated errors, and superscripts B, M, 
and S denote, respectively, Bell-state preparation, mem- 
ory, and measurement. The required error rates for un- 
located Y errors are chosen as smaller values because 
they are actually found to be much smaller than those 
of the unlocated X and Z errors. Provided these condi- 
tions are met, the error rates at the higher levels can be 
reduced asymptotically to zero using the weak-detection 
mode without the need of delayed fusion, as shown in 
Fig-EIb) and Fig.2£b). Note again that these conditions 
are not necessary ones for the fault tolerance. They are 
chosen just for convenience so as to prevent exhaustive 
numerical works. 

The error rates and the resource consumption also 
vary according to how the strong- and the weak-detection 
modes are incorporated in code concatenation. Here, we 
consider two methods. In the first method, the ancilla 
part of a level-/ operation is implemented with strongly- 
detected level-(7 — 1) operations, while the remaining part 
is implemented with weakly-detected level- (I — 1) opera- 
tions. The second method is the same as the first method 
except the ancilla parts of lcvel-2 operations are imple- 
mented with weakly-detected level-1 operations. Com- 
pared to the first method, this method needs less re- 
sources for the same level of concatenation, but leads 
to higher error rates. 

The threshold regions for the first method (solid curve) 
and the second method (dotted curve) are shown in 
Fig. [51 To obtain these curves, we first determine the 
threshold value of 7 with fixing e — 0, which is found to 
be 1.57 x 10 -3 . This does not require much computa- 
tional work, since we have to take into account only pho- 
ton losses at the dangling nodes of level-0 cluster states 
and located error rates at higher levels can be easily de- 
rived from those of the level-0 operations. In this case, 
the unlocated error rate is zero. Once the range of 7 is 
determined, we start guessing the threshold value of e 
with fixing 7 as particular values, corresponding to dots 
in Fig. [6] Near the threshold value, e is varied by a small 
increment of Ae = 1CP 6 , to which extent the threshold 
value is determined. The number of samples for each 
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FIG. 6: Threshold region for scalable quantum computation 
using method 1 (solid curve) and method 2 (dotted curve), 
where 7 and e denote, respectively, the probabilities of photon 
loss and depolarization per operation. 
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FIG. 7: Located error rate Qm of a memory gate (left) and 
average number of two-photon Bell states consumed for it 
(right) with respect to the level of encoding. 

Monte Carlo simulation is chosen as 10 7 at the level-0 
encoding and 10 8 at higher levels. Note that the results 
in Fig. [5] are comparable to those in Ref. [2(| using error- 
correcting codes. 

In order to compare the performance further, we plot 
in Fig. [7] the located error rates of a memory gate (left) 
and the resource consumption for it in terms of the aver- 
age number of two-photon Bell states used (right) with 
respect to the level of encoding for the first method (solid 
curve) and the second method (dotted curve) with two 
sets of parameters: 7 — lOe = 4 x 10~ 4 (unfilled squares) 
and 7 = lOe = 10~ 4 (filled squares). In these simulations, 
l c has been chosen larger than 7, that is, we used delayed 
measurements at every levels simulated. Note that the 
curves are obtained with the help of analytic calculations, 
which are allowed at high levels of encoding wherein lo- 
cated error rates dominate all unlocated error rates and 
the unlocated error rates of weakly-detected operations 
dominate those of strongly-detected operations. In such a 
limiting case, only errors in delayed measurements make 
dominant contributions to the error rates at the next 
higher level, since the weak-detection mode is used only 
for delayed measurements. Consequently, we can easily 
calculate error rates by taking into account only four er- 
ror locations associated with them. As located error rates 
dominate unlocated error rates, they indicate how many 
operations can be performed reliably. Compared to the 
results of Ref. 20], our scheme is found to require less 



9 



resources in general. In case of 7 = lOe = 4x 10~ 4 , we 
obtain the error rate of 10~ 9 using about 4 x 10 18 Bell 
pairs (the first method) , while the scheme in Ref . [2(| re- 
quires about 10 23 Bell pairs. The required resources are 
drastically reduced as the noise strength decreases: in 
case of 7 = lOe = 10 -4 , the same error rate is attained 
using about 10 13 Bell pairs (the second method). 

VII. CONCLUSION 

In summary, a new scheme for fault-tolerant linear op- 
tics quantum computation based on an error-detecting 
code has been introduced and analyzed with most of the 
prominent obstacles encountered in linear optics quan- 
tum computation, such as non-deterministic two-qubit 
operation and photon loss as well as imperfect gate oper- 
ations and decoherence, being taken into account. In or- 
der to deal with them, various established techniques and 
some new ideas were incorporated. The basic building 
block is the error-detecting quantum state transfer. At 
the level of encoded qubits, its role is simply transferring 
quantum information through a linear cluster state, but 
useful insight is that the intermediate measurements also 
give the syndrome information for detecting errors. The 
error-detecting code and encoded operations were con- 



catenated using two types of decoding methods, namely 
the strong- and weak-detection modes, and introducing 
delayed fusion operation. It was shown that the rate of 
error per gate operation can be substantially lowered by 
the concatenation and both the fault-tolerant region of 
initial noise rates and the resource consumption were cal- 
culated numerically. The numerical results indicate that 
the resource consumption is decreased in many orders 
of magnitude compared to the known results. Although 
the results exhibit great improvements, the resource re- 
quirement is still demanding and far from practical im- 
plementation. It might be further reduced by importing 
other techniques such as purification or error correction. 
In order to make the reduction to an extent that allows 
practical linear optics quantum computation, however, 
it seems that new ideas will be required to substantially 
overcome the non-deterministic nature of two-qubit op- 
eration. 
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